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On page 6, line 19, please delete "allowed U.S. Patent Application Serial No. 
08/479,328" and insert --U.S. Patent 5,698,396- therefor. 
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On page 10, line 20, please delete "breain" and insert -brain- therefor. 
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In the Claims 

Please cancel without prejudice claims 1-5, 7-36 and 41-56. 

Please amend the claims as follows: 
6.(amended) An [I]isolated protein encoded by [the] an isolated nucleic acid [of claim 1] 
molecule selected from the group consisting of: 

(a) nucleic acid molecules which encode a cancer associated antigen, and which comprise 
a nucleotide sequence, the complementary sequence of which hybridizes, under stringent 
conditions, to at least one second nucleic acid molecule comprising a nucleotide sequence 
selected from the group consisting of the nucleotide sequences set forth as SEQ ID NOs: 1, 2, 3, 
4, and 5, 

(b) nucleic acid molecules that differ from the nucleic acid molecules of (a) in codon 
sequence due to the degeneracy of the genetic code, and 

(c) complements of (a) or (b) . 

37.(amended) A composition of matter useful in stimulating an immune response to a t least one 
protein encoded by at least one nucleic acid molecule comprising a nucleotide sequence set 
forth in SEQ ID NO: 1, 2, 3, 4 or 5, said composition comprising a plurality of peptides 
derived from the amino acid sequence of said at least one protein, wherein said peptides bind 
to one or more MHC molecules presented on the surface of cells which express an abnormal 
amount of said at least one protein. 

Please add the following new claims: 

57. The isolated protein of claim 6, wherein the nucleic acid molecule comprises SEQ ID 
NO:l. 

58. The isolated protein of claim 6, wherein the nucleic acid molecule comprises SEQ ID 
NO:2. 

59. The isolated protein of claim 6, wherein the nucleic acid molecule comprises SEQ ID 
NO:3. 
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60. The isolated protein of claim 6, wherein the nucleic acid molecule comprises SEQ ID 
NO:4. 

61. The isolated protein of claim 6, wherein the nucleic acid molecule comprises SEQ ID 
NO:5. 

62. The composition of matter of claim 37, wherein the at least one nucleic acid molecule 
comprises the nucleotide sequence set forth in SEQ ID NO:l. 

63. The composition of matter of claim 37, wherein the at least one nucleic acid molecule 
comprises the nucleotide sequence set forth in SEQ ID NO:2. 

64. The composition of matter of claim 37, wherein the at least one nucleic acid molecule 
comprises the nucleotide sequence set forth in SEQ ID NO: 3. 

65. The composition of matter of claim 37, wherein the at least one nucleic acid molecule 
comprises the nucleotide sequence set forth in SEQ ID NO:4. 

66. The composition of matter of claim 37, wherein the at least one nucleic acid molecule 
comprises the nucleotide sequence set forth in SEQ ID NO: 5. 

67. The composition of matter of claim 37, wherein at least one of the plurality of peptides 
is coupled to an immune response stimulating compound. 

REMARKS 

Please enter this Preliminary Amendment before calculating the fees. Applicants have 
deleted all claims but those corresponding to claim group number 2 (claims 6 and 37-40) as set 
forth in the restriction requirement mailed on October 1, 1998 in the parent case, serial number 
08/948,705. Amendments to the Specification are the same as made in the parent application. 



- 4 - 



New claims 57-61 and 62-66 are derived from original claims 6 and 37, respectively. Support 
for new claim 67 can be found in the Specification at page 14, line 16. No new matter has been 
added. 



Respectfully submitted, 

JoM R. Van Amsterdam 
Reg- No. 40,212 
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Boston, MA 02210 
Tel. (617)720-3500 
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FIELD OF THE INVENTION 

This invention relates to the isolation of genes associated with colon cancer, methods 
of diagnosing colon cancer using these, as well as other genes which are known, as well as 
therapeutic approaches to treating such conditions. 

BACKGROUND AND PRIOR ART 

It is fairly well established that many pathological conditions, such as infections, 
cancer, autoimmune disorders, etc., are characterized by the inappropriate expression of 
certain molecules. These molecules thus serve as "markers" for a particular pathological or 
abnormal condition. Apart from their use as diagnostic "targets", i.e., materials to be 
identified to diagnose these abnormal conditions, the molecules serve as reagents which can 
be used to generate diagnostic and/or therapeutic agents. A by no means limiting example of 
this is the use of cancer markers to produce antibodies specific to a particular marker. Yet 
another non-limiting example is the use of a peptide which complexes with an MHC 
molecule, to generate cytolytic T cells against abnormal cells. 

Preparation of such materials, of course, presupposes a source of the reagents used to 
generate these. Purification from cells is one laborious, far from sure method of doing so. 
Another preferred method is the isolation of nucleic acid molecules which encode a 
particular marker, followed by the use of the isolated encoding molecule to express the 
desired molecule. 

To date, two strategies have been employed for the detection of such antigens, in 
e.g., human tumors. These will be referred to as the genetic approach and the biochemical 
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approach. The genetic approach is exemplified by, e.g., dePlaen et aL, Proc. Natl. Sci. 
USA 85: 2275 (1988), incorporated by reference. In this approach, several hundred pools of 
plasmids of a cDNA library obtained from a tumor are transfected into recipient cells, such 
as COS cells, or into antigen-negative variants of tumor cell lines. Transfectants are 
screened for the expression of tumor antigens via their ability to provoke reactions by anti- 
tumor cytolytic T cell clones. The biochemical approach, exemplified by, e.g., 
Mandelboim, et aL, Nature 369: 69 (1994) incorporated by reference, is based on acidic 
elution of peptides which have bound to MHC-class I molecules of tumor cells, followed by 
reversed-phase high performance liquid chromography (HPLC). Antigenic peptides are 
identified after they bind to empty MHC-class I molecules of mutant cell lines, defective in 
antigen processing, and induce specific reactions with cytolytic T-lymphocytes ("CTLs"). 
These reactions include induction of CTL proliferation, TNF release, and lysis of target 
cells, measurable in an MTT assay, or a 5I Cr release assay . 

These two approaches to the molecular definition of antigens have the following 
disadvantages: first, they are enormously cumbersome, time-consuming and expensive; 
second, they depend on the establishment of CTLs with predefined specificity; and third, 
their relevance in vivo for the course of the pathology of disease in question has not been 
proven, as the respective CTLs can be obtained not only from patients with the respective 
disease, but also from healthy individuals, depending on their T cell repertoire. 

The problems inherent to the two known approaches for the identification and 
molecular definition of antigens is best demonstrated by the fact that both methods have, so 
far, succeeded in defining only very few new antigens in human tumors. See, e.g., van der 
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Braggen et al., Science 254: 1643-1647 (1991); Brichard et aL, J. Exp. Med. 178: 489-495 
(1993); Coulie, et al., J. Exp. Med. 180: 35-42 (1994); Kawakami, et al., Proc. Natl. Acad. 
Sci. USA 91: 3515-3519 (1994). 

Further, the methodologies described rely on the availability of established, 
permanent cell lines of the cancer type under consideration. It is very difficult to establish 
cell lines from certain cancer types, as is shown by, e.g., Oettgen, et al., Immunol. Allerg. 
Clin. North. Am. 10: 607-637 (1990). It is also known that some epithelial cell type cancers 
are poorly susceptible to CTLs in vitro, precluding routine analysis. These problems have 
stimulated the art to develop additional methodologies for identifying cancer associated 
antigens. 

One key methodology is described by Sahin, et al., Proc. Natl. Acad. Sci. USA 92: 
11810-11913 (1995), incorporated by reference. Also, see U.S. Patent Applications Serial 
No. 08/580,980, and Application Serial No. 08/479,328, filed on June 7, 1995 and January 
3, 1996, respectively. All three of these references are incorporated by reference. To 
summarize, the method involves the expression of cDNA libraries in a prokaryotic host. 
(The libraries are secured from a tumor sample). The expressed libraries are then 
immunoscreened with absorbed and diluted sera, in order to detect those antigens which 
elicit high titer humoral responses. This methodology is known as the SEREX method 
("Serological identification of antigens by Recombinant Expression Cloning"). The 
methodology has been employed to confirm expression of previously identified tumor 
associated antigens, as well as to detect new ones. See the above referenced patent 
applications and Sahin, et al., supra , as well as Crew, et al., EMBO J 144: 2333-2340 
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(1995). 

The SEREX methodology has now been applied to colon cancer samples. Several 
nucleic acid molecules have been newly isolated and sequenced, and are now associated with 
stomach cancer. Further, a pattern of expression involving these, as well as previously 
isolated genes has been found to be associated with colon cancer. These results are the 
subject of this application, which is elaborated upon in the disclosure which follows. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
Example 1 

Tumor samples were obtained as surgical samples, and were frozen at -80 °C until 
ready for use. 

Total RNA was then isolated from the samples, using the well known guanidium 
thiocyanate method of Chirgwin, et al., Biochemistry 18: 5294-5299 (1979), incorporated by 
reference. The thus obtained total RNA was then purified to isolate all poly A + RNA, using 
commercially available products designed for this purpose. 

The poly A + RNA was then converted into cDNA, and ligated into XZAP, a well 
known expression vector. 

Three cDNA libraries were constructed in this way, using colorectal carcinoma 
samples. A fourth library, also from colorectal carcinoma, was prepared, albeit in a 
different way. The reasons for this difference will be clear in the examples, infra. 

The fourth library was an IgG subtraction library, prepared by using a subtraction 
partner, generated by PCR amplification of a cDNA clone which encoded an IgG molecule. 
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See, e.g., Ace et al, Endocrinology 134: 1305-1309 (1994), and incorporated by reference in 
its entirety. 

This is done to eliminate any false, positive signals resulting from interaction of 
cDNA clones which encode IgG, with the IgG then interacting with the anti-human IgG used 

5 in the assay, as described infra . PCR product was biotinylated, and hybridized with 

denatured second strand cDNA, at 68°C for 18 hours. Biotinylated hybrid molecules were 
coupled to streptavidin, and then removed by phenol chloroform extraction. Any remaining 
cDNA was also ligated into XZAP. All libraries were amplified, prior to immunoscreening 

ry discussed infra. 

W Example 2 

75 Immunoscreening was carried out, using sera obtained from patients undergoing 

U routine diagnostic and therapeutic procedures. The sera were stored at -70°C prior to use. 

-L-- Upon thawing, the sera were diluted at 1:10 in Tris buffered saline (pH 7.5), and were then 

G passed through Sepharose 4B columns. First, the sera were passed through columns which 

15 had E. coli Y1090 lysates coupled thereto, and then lysates from bacteriophage infected E. 

coli BNN97 lysates. Final serum dilutions were then prepared in 0.2% non-fat dried 

milk/Tris buffered saline. 

The method of Sahin et al., Proc. Natl. Acad. Sci. USA 92: 11810-11813 (1995), 

and allowed U.S. patent application Serial No. 08/479,328, both of which are incorporated 
20 by reference, was used, with some modifications. Specifically recombinant phage at a 

concentration of 4X10 3 phages per 15 cm plate (pfus), were amplified for six hours, after 

-6- 
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which they were transferred to nitrocellulose membranes for 15 hours. Then, the 
membranes were blocked with 5% nonfat dried milk. 

As an alternative to the IgG subtraction, discussed supra, membranes were 
prescreened in a 1:2000 dilution of peroxidase conjugated, Fc fragment specific goat anti- 
human IgG, for one hour, at room temperature. Color was developed using 3,3- 
diaminobenzidine tetrahydrochloride, which permitted scoring of IgG encoding clones. 

Membranes were then incubated in 1:100 dilutions of autologous sera, which had 
been pretreated with the Sepharose 4B columns, as described supra . The filters were then 
incubated, in a 1:3000 dilution of alkaline phosphatase conjugated Fc fragment specific, goat 
anti-human IgG, for one hour, at room temperature. The indicator system 4-nitroblue 
tetrazolium chloride/5-bromo-4-chloro-3-indolyl-phosphate was then added, and color 
development assessed. Any positive clones were subcloned, and retested, except the tine on 
the nitrocellulose membrane was reduced to three hours. A total of forty-eight positive 
clones were identified. 

Analysis of probes for SEQ ID NOS: 1 and 2 confirmed their universal expression. 

Example 3 

Example 2 described work using autologous serum. The positive clones were then 
rescreened, using allogeneic serum, following the same method discussed supra, in example 
2, except IgG prescreening was omitted. The allogeneic sera was obtained from sixteen 
normal blood donors, and twenty nine patients who had been diagnosed with colorectal 
cancer. 
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The analysis with the two types of serum revealed that fourteen reacted with a subset 
of sera from normal and cancer patients, twenty-eight only with autologous sera, and six 
with both allogeneic and autologous sera. Over 60% of the allogeneic serum samples tested 
reacted with at least one of these positive clones. About 20% reacted with two or more. 



Example 4 

In view of the results described in example 3, further experiments were carried out 
using serum samples from patients with other forms of cancer, i.e., renal cancer (13 > 
samples), lung cancer (23 samples), and breast cancer (10 samples). The results are set 
forth in Table I which follow: 



Clone Number 


Normal Sera 


Colon 


Renal 


Lung 


Breast 






Cancer 


Cancer 


Cancer 


Cancer 


NY-Co-8 


0/16 


8/29 


1/13 


0/23 


0/10 


NY-Co-9 


0/16 


5/29 


1/13 


1/23 


0/10 


NY-Co- 1 3 


0/16 


5/29 


0/13 


0/23 


0/10 


NY-Co- 16 


0/16 


3/29 


0/13 


0/23 


0/10 


NY-Co-20 


0/16 


4/29 


0/13 


0/23 


0/10 


NY-Co-38 


0/16 


4/29 


3/13 


0/23 


1/10 
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Example 5 

Following the screening work described supra , the cDNA inserts were purified and 
sequenced, following standard methods. 

Of the six clones which were identified as being reactive with autologous and 
allogeneic cancer serum, and not with normal serum, two were found to be identical to 
previously identified molecules. Four others were found to have little or no homology to 
known sequences. These are presented as SEQ ID NOS: 1-4. Of twenty seven allogeneic 
colon cancer serum samples tested, 67% reacted with at least one of these antigens. 

Example 6 

The expression pattern of mRNA corresponding to SEQ ID NOS: 1, 2 and 4, as well 
as other sequences identified via the preceding examples was determined. To do this, RT- 
PCR was carried out on a panel of RNA samples, taken from normal tissue. The panel 
contained RNA of lung, testis, small intestine, colon, breast, liver and placenta tissues. The 
RNA was purchased from a commercial source. RNA from a colon tumor sample was also 
included. All samples were set up for duplicate runs, so that genomic DNA contamination 
could be accounted for. In the controls, no reverse transcriptase was used. 

Primers were designed which were specific for the cDNA, which would amplify 5*- 
fragments, from 300-400 base pairs in length. The PCR reactions were undertaken at an 
annealing temperature of 68 °C. Where appropriate, 5' and 3'-RACE reactions were 
undertaken, using gene specific primers, and adapter primers, together with commercially 
available reagents. Specifically, SEQ ID NOS: 2 and 4 were tested using RACE. The 
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resulting products were subcloned into vector pCR 2.1, screening via PCR using internal 
primers, and then sequenced. 

SEQ ID NOS: 1 and 2 were found to be amplified in all tissues tested. SEQ ID NO: 
4 was found in colon tumor, colon metastasis, gastric cancer, renal cancer and colon cancer 
cell lines Colo 204 and HT29, as well as in normal colon, small intestine, brain, stomach, 
testis, pancreas, liver, lung, heart, fetal brain, mammary gland, bladder, adrenal gland 
tissues. It is was not found in normal uterine, skeletal muscle, peripheral blood 
lymphocytes, placental, spleen thymus, or esophagus tissue, nor in lung cancer. 

The analysis also identified differential expression of a splice variant of SEQ ID NO: 
4, i.e., SEQ ID NO: 5. When the two sequences were compared, it was found that SEQ ID 
NO: 4 encodes a putative protein of 652 amino acids, and molecular weight of 73,337 
daltons. SEQ ID NO: 5, in contrast, lacks an internal 74 base pairs, corresponding to 
nucleotides 1307-1380 of SEQ ID NO: 4. The deletion results in formation of a stop codon 
at the splice function, and a putative protein of 404 amino acids, and molecular weight 
45,839. The missing segment results in the putative protein lacking a PEST protein 
degradation sequence, thereby suggesting a longer half life for this protein. 

In additional experiments, primers designed not to differentiate between SEQ ID 
NOS: 4 and 5 resulted in almost universal amplification (placenta being the only exception). 
In contrast, when primers specific for SEQ ID NO: 5 were used differences were seen in 
normal pancreatic, liver, lung, heart, fetal breain, mammary gland, bladder, and adrenal 
gland tissue, where there was no expression of SEQ ID NO: 5 found. 
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Example 7 

Northern blotting was also carried out for SEQ ID NOS: 1, 2, 4 and 5. To do this, 
the same commercially available RNA libraries discussed supra were used. 

Samples (2 ug) of polyA + RNA were analyzed from these samples, using random, 32 P 
labelled probes 300-360 nucleotides in length, obtained from PGR products. These probes 
were hybridized to the RNA, for 1.5 hours, at 68°C, followed by two washes at O.lxSSC, 
0.1% SDS, 68°C, for 30 minutes each time. 

SEQ ID NOS: 1 and 2 were again found to be universally expressed. 

Example 8 

Further screening identified additional isoforms of SEQ ID NOS: 1 and 4. These are 
set forth as SEQ ID NOS: 6, 7 and 8. The isoform represented by SEQ ID NO: 6 is a 
naturally occurring splice variant of SEQ ID NO: 1, found in normal colon. SEQ ID NO: 
7, which is an isoform of SEQ ID NO: 4, was found in brain tissue, primarily spinal chord 
and medulla. SEQ ID NO: 8, was found in normal kidney and in colon tumors, 
metastasized colon cancer, gastric cancer, and in colon cancer cell line Colo 205. It was not 
found in any normal tissue other than kidney. 

The foregoing examples demonstrate several features of the invention. These include 
diagnostic methods for determining presence of transformed cells, such as colon cancer cells, 
in a sample. The sample may contain whole cells or it may be, e.g., a body fluid sample, 
or an effusion, etc., where the sample may contain cells, but generally will contain shed 
antigen. The experiments indicate that there is a family of proteins, expression of which is 

-11- 
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associated with colon cancer. Hence, the invention involves, inter alia, detecting at least two 
of the proteins encoded by any of SEQ ID NOS: 1-5 wherein, presence of these is indicative 
of a pathology, such as colon cancer or other type of related condition. Exemplary of the 
type of diagnostic assays which can be carried out are immunoassays, amplification assays 
(e.g., PCR), or, what will be referred to herein as a "display array". "Display array" as 
used herein refers to a depiction of the protein profile of a given sample. Exemplary of such 
displays are 2-dimensional electrophoresis, banding patterns such as SDS-gels, and so forth. 
Thus, one aspect of the invention involves diagnosing colon cancer or a related condition by 
determining protein display of a sample, wherein a determination of at least one of the 
proteins, or expression of their genes, is indicative of colon cancer or a related condition. 
There are many ways to carry out these assays. For example, as indicated herein, antibodies 
to the proteins were found in patient samples. One can assay for these antibodies using, 
e.g., the methodology described herein, or by using a purified protein or proteins or 
antigenic fragment thereof, and so forth. One can also assay for the protein itself, using 
antibodies, which may be isolated from samples, or generated using the protein and standard 
techniques. This antibodies can then be labelled, if desired, and used in standard 
immunoassays. These antibodies or oligonucleotide probes/primers may also be used to 
examine biopsied tissue samples, e.g., to diagnose precancerous conditions, early stage 
cancers, and so forth. 

Similarly, any and all nucleic acid hybridization systems can be used, including 
amplification assays, such as PCR, basic probe hybridization assays, and so forth. The 
antibodies, such as polyclonal antibodies, monoclonal antibodies, the hybridomas which 

-12- 
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produce them, recombinantly produced antibodies, binding fragments of these, hybridization 
kits, DNA probes, and so forth, are all additional features of the invention. 

Any of these assays can also be used in progression/regression studies. One can 
monitor the course of an abnormality such as colon cancer which involve expression of any 
one of the proteins, the expression of which is governed by the nucleic acid molecules SEQ 
ID NOS: 1-5, simply by monitoring levels of the protein, its expression, and so forth using 
any or all of the methods set forth supra . 

As has been indicated supra , the isolated nucleic acid molecules which comprise the 
nucleotide sequences set forth in SEQ ID NOS: 1-5 are new, in that they have never been 
isolated before. These nucleic acid molecules may be used as a source to generate colon 
cancer specific proteins and peptides derived therefrom, and oligonucleotide probes which 
can themselves be used to detect expression of these genes. Hence, a further aspect of the 
invention is an isolated nucleic acid molecule which comprises any of the nucleotide 
sequences set forth in SEQ ID NOS: 1-5, or molecules whose complements hybridize to one 
or more of these nucleotide sequences, under stringent conditions, expression vectors 
comprising these molecules, operatively linked to promoters, cell lines and strains 
transformed or transfected with these, and so forth. "Stringent conditions", is used herein, 
refers to condition such as those specified in U.S. Patent No. 5,342,774, i.e., 18 hours of 
hybridization at 65°C, followed by four one hour washes at 2xSSC, 0.1% SDS, and a final 
wash at 0.2xSSC, more preferably O.lxSSC, 0.1% SDS for 30 minutes, as well as alternate 
conditions which afford the same level of stringency, and more stringent conditions. 

It should be clear that these methodologies may also be used to track the efficacy of a 
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therapeutic regime. Essentially, one can take a baseline value for the protein or proteins 
being tested, using any of the assays discussed supra, administer a given therapeutic, and 
then monitor levels of the protein or proteins thereafter, observing changes in protein levels 
as indicia of the efficacy of the regime. 

The identification of the proteins and nucleic acid molecules set forth herein as being 
implicated in pathological conditions such as colon cancer also suggests a number of 
therapeutic approaches to such conditions. The experiments set forth supra establish that 
antibodies are produced in response to expression of these proteins, suggesting their use as a 
vaccine. Hence, a further embodiment of the invention is the treatment of conditions which 
are characterized by expression of one or more of the subject proteins, via 
immunotherapeutic approaches. One of these approaches is the administration of an amount 
of one or more these proteins, or an immunogenic peptide derived therefrom in an amount 
sufficient to provoke or augment an immune response. The proteins or peptides may be 
combined with one or more of the known immune adjuvants, such as saponins GM-CSF 
interleukins, and so forth. If the peptides are too small to generate a sufficient antibody 
response, they can be coupled to the well known conjugates used to stimulate responses. 

Similarly, the immunotherapeutic approaches include administering an amount of 
inhibiting antibodies sufficient to inhibit the protein or proteins. These antibodies may be, 
e.g., antibodies produced via any of the standard approaches elaborated upon supra. 

T cell responses may also be elicited by using peptides derived from the proteins 
which then complex, non-covalently, with MHC molecules, thereby stimulating proliferation 
of cytolytic T cells against any such complexes in the subject. It is to be noted that the T 
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cells may also be elicited in vitro, and then reperfused into the subject being treated. 

Note that the generation of T cells and/or antibodies can also be accomplished by 
administering cells, preferably treated to be rendered non-proliferative, which present 
relevant T cell or B cell epitopes for response. 
5 The therapeutic approaches may also include gene therapies, wherein an antisense 

molecule, preferably from 10 to 100 nucleotides in length, is administered to the subject 
either "neat 1 ' or in a carrier, such as a liposome, to facilitate incorporation into a cell, 
followed by inhibition of expression of the protein. Such antisense sequences may also be 
incorporated into appropriate vaccines, such as in viral vectors (e.g., Vaccinia), bacterial 

I© constructs, such as variants of the well known BCG vaccine, and so forth. 

fy An additional DNA based therapeutic approach is the use of a vector which comprises 

one or more nucleotide sequences, preferably a plurality of these, each of which encodes an 
immunoreactive peptide derived from the expressed proteins. One can combine these 

: T peptides expressing sequences in all possible variations, such as one from each protein, 

rl;5 several from one or more protein and one from each of the additional proteins, a plurality 

from some and none from others, and so forth. 

Other features of the invention will be clear to the skilled artisan, and need not be 
repeated here. 

The terms and expressions which have been employed are used as terms of 
20 description and not of limitation, and there is no intention in the use of such terms and 
expressions of excluding any equivalents of the features shown and described or portions 
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thereof, it being recognized that various modifications are possible within the scope of the 
invention. 
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We Claim: 

1. An isolated nucleic acid molecule which comprises a nucleotide sequence, the 
complementary sequence of which hybridizes, under stringent conditions to at least one 
nucleic acid molecule having a nucleotide sequence set forth at SEQ ID NO: 1, 2, 3, 4 or 5. 

2. Expression vector comprising the isolated nucleic acid molecule of claim 1, 
operably linked to a promoter. 

3. Cell line or cell strain, transformed or transfected with the isolated nucleic acid 
molecule of claim 1 . 

4. Cell line or cell strain, transformed or transfected with the expression vector of 
claim 2. 

5. The isolated nucleic acid molecule of claim 1, which consists of a nucleotide 
sequences set forth at SEQ ID NO: 1, 2, 3, 4 or 5. 

6. Isolated protein encoded by the isolated nucleic acid molecule of claim 1. 

7. Method for determining colon cancer in a sample, comprising assaying said 
sample for expression of the isolated nucleic acid molecule of claim 1 , presence thereof 
being indicative of colon cancer. 
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8. Method for determining colon cancer in a sample, comprising assaying said 
sample for a protein encoded by the isolated nucleic acid molecule of claim 1, presence of 
said protein being indicative of colon cancer in said sample. 

9. The method of claim 8, comprising assaying said sample for a peptide derived 
from said protein. 

10. Composition of matter useful in treating colon cancer, comprising the isolated 
nucleic acid molecule of claim 1, and a pharmaceutically acceptable carrier. 

11. Composition of matter useful in treating colon cancer, comprising therapeutically 
effective amount of the expression vector of claim 2, and a pharmaceutically acceptable 
carrier. 

12. Composition of matter useful in treating colon cancer, comprising a 
therapeutically effective amount of the cell line or cell strain of claim 3, and having 
presented on its surface, a peptide complexed to an MHC molecule to form a complex which 
provokes an immune response against colon cancer cells. 

13. A method for treating a subject afflicted with a colon cancer or related disorder, 
comprising: 

(i) removing an immunoreactive cell containing sample from said subject, 
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(ii) contacting the immunoreactive cell containing sample to the cell line or cell strain 
of claim 3 conditions favoring production of cytolytic T cells against a peptide derived from 
the proteins encoded by the isolated nucleic acid molecule used to transfect or transform said 
cell, and 

(iii) introducing said cytolytic T cells to said subject in an amount sufficient to lyse 
said cells. 

14. A method for treating a subject afflicted with colon cancer or related condition, 
comprising: 

(i) identifying a gene expressed by cancer cells associated with said colon cancer or 
related condition, wherein said gene comprises at least one nucleotide sequence set forth in 
SEQ ID NOS: 1-5; 

(ii) identifying an MHC molecule which presents a portion of an expression product 
of said gene; 

(iii) transfecting a host cell having the same molecule as identified in (ii) with said 

gene; 

(iv) culturing said transfected cells to express said gene, and; 

(v) introducing an amount of said cells to said subject sufficient to provoke an 
immune response against said cancer cells. 

15. The method of claim 14, wherein said immune response comprises a B-cell 
response. 
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16. The method of claim 14, wherein said immune response comprises a T-cell 
response. 

17. The method of claim 15, wherein said B cell response comprises production of 
antibodies specific to said expression product or peptide derived therefrom. 

18. The method of claim 16, wherein said T-cell response comprises generation of 
cytolytic T-cell specific for cells presenting said peptide. 

19. The method of claim 14, further comprising treating said cells to render them 
non-proliferative. 

20. A method for treating a subject with a cancerous condition characterized by 
abnormal amounts of a protein, comprising: 

(i) identifying a gene expressed by said abnormal cells, comprising at least one 
nucleotide sequence set forth in SEQ ID NOS: 1-5; 

(ii) transfecting a host cell having the same MHC type as said patient expressing said 

gene; 

(iii) culturing said transfected cells to express said gene, and; 

(iv) introducing an amount of said cells to said subject sufficient to provoke an 
immune response against said cancerous condition. 
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21. The method of claim 20, further comprising treating said cells to render them 
non proliferative. 

22. A method for treating a subject with a condition characterized by abnormal 
amounts of protein encoded by a nucleic acid molecule comprising a nucleotide sequence set 
forth in SEQ ID NO: 1, 2, 3, 4 or 5, comprising administering to said subject an amount of 
a cell transfected with (i) a nucleic acid molecule which codes for said protein and (ii) a 
nucleic acid sequence which codes for an MHC molecule which presents a peptide derived 
from said protein wherein said peptide is presented by cells associated with said condition, 
sufficient to alleviate said condition. 

23. Method of claim 22, further comprising treating said cell to render it non- 
proliferative. 

24. A method for treating a subject afflicted with a condition characterized by an 
abnormal amount of protein encoded by a nucleic acid molecule comprising a nucleotide 
sequence set forth at SEQ ID NO: 1, 2, 3, 4 or 5, comprising administering to said subject 
an amount of a reagent consisting essentially of non-proliferative cells having expressed on 
its surface a peptide characteristic of said abnormal cells in an amount sufficient to elicit an 
immune response thereto. 
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25. A method for treating a subject afflicted with a condition characterized by an 
abnormal amount of a protein encoded by a nucleic acid molecule comprising a nucleotide 
sequence set forth at SEQ ID NO: 1, 2, 3, 4 or 5, comprising administering to said subject 
an antibody which specifically binds to said protein or a peptide derived therefrom, said 
antibody being coupled to a therapeutically useful agent, in an amount sufficient to treat said 
condition. 

26. A method for treating a subject afflicted with a condition characterized by 
abnormal amounts of a protein encoded by a nucleic acid molecule comprising a nucleotide 
sequence set forth at SEQ ID NO: 1, 2, 3, 4 or 5, comprising administering to said subject a 
sample of non-proliferative cells which express said protein in an amount sufficient to 
alleviate said condition. 

27. A method for preventing onset of a condition characterized by abnormal amounts 
of a protein encoded by a nucleic acid molecule comprising a nucleotide sequence set forth at 
SEQ ID NO: 1, 2, 3, 4 or 5, in a subject comprising administering an amount of a vaccine 
comprising said protein and an adjuvant in an amount sufficient to prevent onset of said 
cancerous condition in said subject. 

28. A method for preventing onset of a condition characterized by abnormal amounts 
of a protein encoded by a nucleic acid molecule comprising a nucleotide sequence set forth at 
SEQ ID NO: 1, 2, 3, 4 or 5, in a subject comprising administering an amount of a vaccine 
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comprising a peptide derived from said protein in an amount sufficient to prevent onset of 
said condition in said subject. 

29. A method for preventing onset of a condition characterized by abnormal amounts 
of a protein encoded by a nucleic acid molecule comprising a nucleotide sequence set forth at 
SEQ ID NO: 1, 2, 3, 4 or 5, in a subject comprising administering an amount of a vector 
which comprises a gene encoding said protein to a cell which is capable of expressing said 
protein or presenting a peptide derived therefrom, in an amount sufficient to prevent onset of 
said cancerous condition in said subject. 

30. A method for treating a subject afflicted with a condition characterized by 
abnormal amounts of a protein encoded by a nucleic acid molecule comprising a nucleotide 
sequence set forth at SEQ ID NO: 1, 2, 3, 4 or 5, comprising: 

(i) identifying cells from said subject which express abnormal amounts of said 
protein; 

(ii) isolating a sample of said cells; 

(iii) cultivating said cell, and 

(iv) introducing said cells to said subject in an amount sufficient to provoke an 
immune response against said cells. 

31. The method of claim 30, further comprising rendering said cells non 
proliferative, prior to introducing them to said subject. 
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32. A method for treating a subject with a condition characterized by abnormal 
amounts of at least one protein encoded by a nucleic acid molecule comprising a nucleotide 
sequence set forth in any of SEQ ID NO: 1, 2, 3, 4 or 5, comprising administering to said 
subject a vector which comprises a sequence encoding at least one immunoreactive peptide 
derived from said at least one protein, to a cell capable of expressing and presenting said at 
least one peptide, in an amount sufficient to induce an immune response in said subject. 

33. A method for following progress of a therapeutic regime designed to alleviate a 
condition characterized by abnormal expression of a protein encoded by a nucleic acid 
molecule comprising a nucleotide sequence set forth at SEQ ID NO: 1, 2, 3, 4 or 5, 
comprising: 

(a) assaying a sample from a subject to determine level of a parameter selected from 
the group consisting of (i) a peptide derived from said protein, (ii) a cytolytic T cell specific 
for cells presenting said peptide, and (iii) an antibody which specifically binds to said peptide 
of said protein, at a first time period; 

(b) assaying level of the parameter selected in (a) at a second period of time and 
comprising it to the level determined in (a) as a determination of effect of said therapeutic 
regime. 

34. A method for treating a pathological cell condition characterized by aberrant 
expression of a protein encoded by a nucleic acid molecule comprising a nucleotide sequence 
set forth at SEQ ID NO: 1, 2, 3, 4 or 5 associated with said condition, comprising 
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administering to a subject in need thereof an effective amount of either: (a) a protein 
inhibitor, or (b) an inhibitor of gene expression of said protein. 

35. The method of claim 34, wherein said protein inhibitor is an inhibiting antibody. 

36. The method of claim 34, wherein said inhibitor of gene expression is an antisense 
molecule. 

37. A composition of matter useful in stimulating an immune response to a protein 
encoded by a nucleic acid molecule comprising a nucleotide sequence set forth in SEQ ID 
NO: 1, 2, 3, 4 or 5, comprising a plurality of peptides derived from the amino acid 
sequence of said protein, wherein said peptides bind to one or more MHC molecules 
presented on the surface of cells which express an abnormal amount of said protein. 

38. The composition of matter of claim 37, wherein at least a portion of said 
plurality of peptides bind to MHC molecules and elicit a cytolytic response thereto. 

39. The composition of matter of claim 38, further comprising an adjuvant. 

40. The composition of matter of claim 39, wherein said adjuvant is a saponin, GM- 
CSF, or an interleukin. 
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41. An isolated antibody which binds to a protein encoded by a nucleic acid 
molecule which comprises the nucleotide sequence of any of SEQ ID NO: 1, 2, 3, 4 or 5. 

42. The isolated antibody of claim 41, wherein said antibody is a monoclonal 
antibody. 

43. An isolated antibody which specifically binds to a peptide derived from a protein 
encoded by a nucleic acid molecule which comprises a nucleotide sequence set forth in any 
of SEQ ID NO: 1, 2, 3, 4 or 5. 

44. The antibody of claim 43, wherein said antibody is a monoclonal antibody. 

45. An isolated antibody which specifically binds to a complex of (i) a peptide 
derived from protein encoded by a nucleic acid molecule which comprises a nucleotide 
sequence set forth in any of SEQ ID NO: 1, 2, 3, 4 or 5 and (ii) an MHC molecule to which 
said peptide complexes, but does not bind to (i) or (ii) alone. 

46. The antibody of claim 45, wherein said antibody is a monoclonal antibody. 

47. Method for determining regression, progression or onset of a condition 
characterized by abnormal levels of a protein encoded by a nucleic acid molecule comprising 
a nucleotide sequence set forth in SEQ ID NO: 1, 2, 3, 4 or 5, comprising monitoring a 
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sample from a patient with said condition for a parameter selected from the group consisting 
of (i) protein, (ii) a peptide derived from said protein and (iii) cytolytic T cells specific for a 
peptide derived from said protein and an MHC molecule to which is bound is indicative of 
progression or regression or onset of said condition. 

48. The method of claim 47, wherein said sample is a body fluid or effusion. 

49. The method of claim 47, wherein said sample is a tissue. 

50. The method of claim 47, wherein contacting said sample with an antibody which 
specifically binds with said protein or peptide. 

51. The method of claim 50, wherein said antibody is labelled with a radioactive 
label or an enzyme. 

52. The method of claim 50, wherein said antibody is a monoclonal antibody. 

53. The method of claim 47, comprising amplifying RNA which codes for said 
protein. 

54. The method of claim 53, wherein said amplifying comprises carrying out 
polymerase chain reaction. 
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55. The method of claim 47, comprising contacting said sample with a nucleic acid 
molecule which specifically hybridizes to a nucleic acid molecule which codes for or 
expresses said SCP protein. 

56. The method of claim 47, comprising assaying said sample for said peptide. 
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Abstract Of The Disclosure 

Various molecules associated with disorders such as colon cancer are disclosed. The 
invention also discloses diagnostic and therapeutic methods based upon these molecules. 
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15 <140> US 08/948,705 

<141> 1997-10-10 

% <160> 8 

■f^0 <170> FastSEQ for Windows Version 3 . 0 

■0 . <210> 1 

<211> 1552 
?J " ! <212> DNA 

1 25 <213> Homo Sapiens 



h- <400> 1 

h-* cttctggatg catccgagaa gctaaaactt acttatgagg aaaagtgtga aattgaggaa 60 

C) tcccaattga agtttttgag gaacgactta gctgaatatc agagaacttg tgaagatctt 120 

^-30 aaagagcaac taaagcataa agaatttctt ctggctgcta atacttgtaa ccgtgttggt 180 

ggtctttgtt tgaaatgtgc tcagcatgaa gctgttcttt cccaaaccca tactaatgtt 240 

catatgcaga ccatcgaaag actggttaaa gaaagagatg acttgatgtc tgcactagtt 300 

tccgtaagga gcagcttggc agatacgcag caaagagaag caagtgetta tgaacaggtg 360 

aaacaagttt tgcaaatatc tgaggaagcc aattttgaaa aaaccaaggc tttaatccag 420 

35 tgtgaccagt tgaggaagga gctggagagg caggcggagc gacttgaaaa agaacttgca 480 

tctcagcaag agaaaagggc cattgagaaa gacatgatga aaaaggaaat aacgaaagaa 540 

agggagtaca tgggatcaaa gatgttgatc ttgtctcaga atattgccca actggaggcc 600 

caggtggaaa aggttacaaa ggaaaagatt tcagctatta atcaacrgga ggaaattcaa 660 

agccagctgg cttctcggga aatggatgtc acaaaggtgt gtggagaaat gcgctatcag 720 

40 ctgaataaaa ccaacatgga gaaggatgag gcagaaaagg agcacagaga gttcagagca 780 

aaaactaaca gggatcttga aattaaagat caggaaatag agaaatcgag aatagaactg 840 

gatgaaagca aacaacactt ggaacaggag cagcagaagg cagccctggc cagagaggag 900 

tgcctgagac taacagaact gctgggcgaa tctgagcacc aactgcacct caccagatct 960 

gaaatagctc aactcagtca agaaaaaagg tatacatatg ataaattggg aaagttacag 1020 

45 agaagaaatg aagaattgga ggaacagtgt gtccagcatg ggagagtaca tgagacgatg 1080 



-2- 

aagcaaaggc taaggcagct ggataagcac agccaggcca cagcccagca gctggtgcag 1140 

ctcctcagca agcagaacca gcttctcctg gagaggcaga gcctgtcgga agaggtggac 1200 

cggctgcgga cccagttacc cagcatgcca caatctgatt gctgacctgg atggaacaga 1260 

gtgaaataaa tgaattacaa agagatattt acattcatct ggtttagact taatatgcca 1320 

5 caacgcacca cgaccttccc agggtgacac cgcctcagcc tgcagtgggg ctggtcctca 1380 

tcaacgcggg cgctgtcccc gcacgcagtc gggctggagc tggagtctga ctctagctga 1440 

gcagactcct ggtgtatgtt ttcagaaatg gcttgaagtt atgtgtttaa atctgctcat 1500 

tcgtatgcta ggttatacat atgattttca ataaatgaac tttttaaaga aa 1552 

10 <210> 2 

<211> 2885 

<212> DNA 

<213> Homo Sapiens 

15 <400> 2 

ggaattcctc ttgtcgaagt caaaggagcc cacaccaggc ggcctcaacc attccctccc 60 

acagcacccc aaatgctggg gagcccacca tgcttctttg gaccagagtt cccctcccca 120 

' z i gagcggcccc cctgggacgc ctccctccta caaactgcct ttgcctgggc cctacgacag 180 

L rS tcgagacgac ttccccctcc gcaaaacagc ctctgaaccc aacttgaaag tgcgttcaag 240 

:^p0 gctaaaacag aaggtggctg agcggagaag cagtcccctc ctgcgtcgca aggatgggac 300 

Ti tgttattagc acctttaaga agagagctgt tgagatcaca ggtgccgggc ctggggcgtc 360 

,Q . gtccgtgtgt aacagcgcac ccggctccgg ccccagctct cccaacagct cccacagcac 420 

catcgctgag aatggcttta ctggctcagt ccccaacatc cccactgaga tgctccctca 480 

gcaccgagcc ctccctctgg acagctcccc caaccagttc agcctctaca cgtctccttc 540 

zJ15 tctgcccaac atctccctag ggctgcaggc cacggtcact gtcaccaact cacacctcac 600 

^; tgcctccccg aagctgtcga cacagcagga ggccgagagg caggccctcc agtccctgcg 660 

'C- gcagggtggc acgctgaccg gcaagttcat gagcacatcc tctattcctg gctgcctgct 720 

^ gggcgtggca ctggagggcg acgggagccc ccacgggcat gcctccctgc tgcagcatgt 780 

C'! gctgttgctg gagcaggccc ggcagcagag caccctcatt gctgtgccac tccacgggca 840 

00 gtccccacta gtgacgggtg aacgtgtggc caccagcatg cggacggtag gcaagctccc 900 

gcggcatcgg cccctgagcc gcactcagtc ctcaccgctg ccgcagagtc cccaggccct 960 

gcagcagctg gtcatgcaac aacagcacca gcagttcctg gagaagcaga agcagcagca 1020 

gctacagctg ggcaagatcc tcaccaagac aggggagctg cccaggcagc ccaccaccca 1080 

ccctgaggag acagaggagg agctgacgga gcagcaggag gtcttgctgg gggagggagc 1140 

35 cctgaccatg ccccgggagg gctccacaga gagtgagagc acacaggaag acctggagga 1200 

ggaggacgag gaagaggatg gggaggagga ggaggattgc atccaggtta aggacgagga 1260 

gggcgagagt ggtgctgagg aggggcccga cttggaggag cctggtgctg gatacaaaaa 1320 

actgttctca gatgcccaac cgctgcaacc tttgcaggtg taccaagcgc ccctcagcct 1380 

ggccactgtg ccccaccaag ccctgggccg tacccaatcc tcccctgctg cccctggggg 1440 

40 catgaagaac cccccagacc aacccgtcaa gcacctcttc accacaagtg tggtctacga 1500 

cacgttcatg ctaaagcacc agtgcatgtg cgggaacaca cacgtgcacc ctgagcatgc 1560 

tggccggatc cagagcatct ggtcccggct gcaggagaca ggcctgctta gcaagtgcga 1620 

gcggatccga ggtcgcaaag ccacgctaga tgagatccag acagtgcact ctgaatacca 1680 

caccctgctc tatgggacca gtcccctcaa ccggcagaag ctagacagca agaagttgct 1740 

45 cggtcccatc agccagaaga tgtatgctgt gctgccttgt gggggcatcg gggtggacag 1800 
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tgacaccgtg tggaatgaga tgcactcctc cagtgctgtg cgcatggcag tgggctgcct 1860 

gctggagctg gccttcaagg tggctgcagg agagctcaag aatggatttg ccatcatccg 1920 

gcccccagga caccacgccg aggaatccac agccatggga ttctgcttct tcaactctgt 1980 

agccatcacc gcaaaactcc tacagcagaa gttgaacgtg ggcaaggtcc tcatcgtgga 2040 

5 ctgggacatt caccatggca atggcaccca gcaggcgttc tacaatgacc cctctgtgct 2100 

ctacatctct ctgcatcgct atgacaacgg gaacttcttt ccaggctctg gggctcctga 2160 

agaggttggt ggaggaccag gcgtggggta caatgtgaac gtggcatgga caggaggtgt 2220 

ggaccccccc attggagacg tggagtacct tacagccttc aggacagtgg tgatgcccat 2280 

tgcccacgag ttctcacctg atgtggtcct agtctccgcc gggtttgatg ctgttgaagg 2340 

10 acatctgtct cctctgggtg gctactctgt caccgccaga tgttttggcc acttgaccag 2400 

gcagctgatg accctggcag ggggccgggt ggtgctggcc ctggagggag gccatgactt 2460 

gaccgccatc tgtgatgcct ctgaagcttg tgtctcggct ctgctcagtg taaagctgca 2520 

gcccttggat gaggcagtct tgcagcaaaa gcccaacatc aacgcagtgg ccacgctaga 2580 

gaaagtcatc gagatccaga gcaaacactg gagctgtgtg cagaagttcg ccgctggtct 2640 

15 gggccggtcc ctgcgagggg cccaagcagg tgagaccgaa gaagccgaaa tgtgaacgcc 2700 

atggccttgc tgttggtggg ggccgaacag gcccaagctg cggcagcccg ggaacacagc 2760 

cccaggccgg cagaggagcc catggagcag gagcctgccc tgtgacgccc cggcccccat 2820 

■ ccctttgggc ttcaccattg tgattttgtt tattttttct attaaaaaca aaaagttaaa 2880 

I aattt 2885 



-20 



.25 



BO 



35 



40 



<210> 3 

<211> 1298 

<212> DNA 

<213> Homo Sapiens 

<220> 

<221> unsure 
<222> 55. .55 
<223> 

<220> 

<221> unsure 
<222> 141. .141 
<223> 

<220> 

<221> unsure 
<222> 199. .99 
<223> 

<220> 

<221> unsure 
<222> 342. .342 
<223> 



45 
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<220> 
, <221> unsure 
<222> 352. .352 
<223> 

5 

<220> 

<221> unsure 
<222> 722. .722 
<223> 

10 

<220> 

<221> unsure 
<222> 750. .750 
<223> 

15 

<220> 

<221> unsure 
<222> 1058. .1058 
<223> 

20 

<220> 

<221> unsure 
<222> 1101. .1101 
<223> 

25 

<220> 

<221> unsure 
<222> 1144. .1144 
<223> 

30 

<400> 3 

ggctgctgaa atgactgcga accggcttgc 
agaactagcg gatttgccaa aagactacct 
tgatggagag agaaagcatc naaagcttct 

35 taggcggaaa ttggctgana ggtctgaggc 
ttctgaagga tcaggagaaa agctggtcct 
atcttctttg gccactgtga aaaagcaact 
gttacctctg aacaaagaag agattgaacg 
cacaagtcct ctccaaatgg gaccctgtcg 

40 tttttcccct ggagaaagag gagccagcca 
ggaaggcaag aactcccctg gagcaggaaa 
cagtgacaga ccctttactg acccctgtgg 
aagaggcaaa gatgcgacga gcagagcttc 

191937.1 



agagagcctt 


ctggctttga 


gccancagga 


60 


cttgagtgag 


agtgaagatg 


agggggacaa 


120 


ggaagcaatc 


agttcccttg 


atggaaagaa 


180 


tagtctgaag 


gtgtcagagt 


tcaatgtcag 


240 


tgcagatctg 


cttgagcctg 


ttaaaacttc 


300 


gagtagagtc 


anatcaaaga 


anacagtgga 


360 


gatccacaga 


gaatagcatt 


caataaaacg 


420 


tcctgaagaa 


ccggcaggca 


gagcagctgg 


480 


ttgctcccat 


tgaacatgtg 


ctcagtggct 


540 


ttttcaacct 


cctccataag 


aacaagcagc 


600 


aaaaggcctc 


tctccgagcc 


atgagcctag 


660 


agagggctcg 


ggctctgcag 


tcctactatg 


720 
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angccaaggc tcgaagagag aagaaaatcn aaagttaaaa gtatcac^aa gtcgtgaaga 780 

aaggaaaggc caagaaagcc ctaaaagagt ttgagcagct gcggaaggtt aatccagctg 840 

ccgcactaga agaacgaaga aaagaggaaa gaaggaggag gagaaagaag aagaacaagg 900 

agaagaagaa agaagaaggg agaaggagaa gaaaagaagg agaagaggaa aaggaagaag 960 

5 gagaaagaaa aggagaagga aaaggaaaag aaggagaaga aagaagaact aagaagaagg 1020 

agaggaagaa taagaaggaa agaagaaaga aaaaagtnaa agaagaagaa agaaggaaga 1080 

aggaaagaag aggaagaact nagaagaaga aagaggagga aagaagaaag aagaataagg 1140 

aacnagaaag aaggagaaga aagaataaga agaggaagaa gaaaaagaag aaaagaagaa 1200 

ggaaagaagg agaaaaagga agaaaaaagg aagaagaaag tagaaagcgg aagaaagaaa 1260 

10 agaaagtata agaaggaaga agaagaaaga aggaaaaa 1298 

<210> 4 
<211> 2236 
<212> DNA 
15 <213> Homo Sapiens 

^ <400> 4 

S cctggcccgg tcgcggtcgc ggctctttcc agctcctggc agccgggcac ccgaaggaac 60 

iji gggtcgtgca acgacgcagc tggacctggc ccagccatgg accgaaaagt ggcccgagaa 120 

□20 ttccggcata aggtggattt tctgattgaa aatgatgcag agaaggacta tctctatgat 180 

ru gtgctgcgaa tgtaccacca gaccatggac gtggccgtgc tcgtgggaga cctgaagctg 240 

^ . gtcatcaatg aacccagccg tctgcctctg tttgatgcca ttcggccgct gatcccactg 300 

:^ aagcaccagg tggaatatga tcagctgacc ccccggcgct ccaggaagct gaaggaggtg 360 

J 1 cgtctggacc gtctgcaccc cgaaggcctc ggcctgagtg tgcgtggtgg cctggagttt 420 

q25 ggctgtgggc tcttcatctc ccacctcatc aaaggcggtc aggcagacag cgtcgggctc 480 

rtj caggtagggg acgagatcgt ccggatcaat ggatattcca tctcctcctg tacccatgag 540 

r~ gaggtcatca acctcattcg aaccaagaaa actgtgtcca tcaaagtgag acacatcggc 600 

^ ctgatccccg tgaaaagctc tcctgatgag cccctcactt ggcagtatgt ggatcagttt 660 

^ gtgtcggaat ctgggggcgt gcgaggcagc ctgggctccc ctggaaatcg ggaaaacaag 720 

: ^"30 gagaagaagg tcttcatcag cctggtaggc tcccgaggcc ttggctgcag catttccagc 780 

ggccccatcc agaagcctgg catctttatc agccatgtga aacctggctc cctgtctgct 840 

gaggtgggat tggagatagg ggaccagatt gtcgaagtca atggcgtcga cttctctaac 900 

ctggatcaca aggaggctgt aaatgtgctg aaaaatagcc gcagcctgac catctccatt 960 

gtagctgcag ctggccggga gctgttcatg acagaccggg agcggctggc agaggcgcgg 1020 

35 cagcgtgagc tgcagcggca ggagcttctc atgcagaagc ggctggcgat ggagtccaac 1080 

aagatcctcc aggagcagca ggagatggag cggcaaagga gaaaagaaat tgcccagaag 1140 

gcagcagagg aaaatgagag ataccggaag gagatggaac agattgtaga ggaggaagag 1200 

aagtttaaga agcaatggga agaagactgg ggctcaaagg aacagctact cttgcctaaa 1260 

accatcactg ctgaggtaca cccagtaccc cttcgcaagc caaagtatga tcagggagtg 1320 

40 gaacctgagc tcgagcccgc agatgacctg gatggaggca cggaggagca gggagagcag 1380 

gatttccgga aatatgagga aggctttgac ccctactcta tgttcacccc agagcagatc 1440 

atggggaagg atgtccggct cctacgcatc aagaaggagg gatccttaga cctggccctg 1500 

gaaggcggtg tggactcccc cattgggaag gtggtcgttt ctgctgtgta tgagcgggga 1560 

191937.1 
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gctgctgagc ggcatggtgg cattgtgaaa 
attgtgacag actacaccct ggctgaggct 
ggcggggact ggatcgacct tgtggttgcc 
ctgaccttct tgctgaagtc caaaagggga 
5 ctccggcccc acctcgtgaa cacaaagcct 
cacaccagat ggcatccttg ggacctgaat 
gccctgaacc agggccagat aaggaacagc 
ggaaagggag cagccagccg tttgggagaa 
cctctggccc agtgaatttg gtctctccca 
10 taagacccca ctggagtctc tctctctcca 
tgccaggatt gtcactccaa accttactct 
ttccagctta aaaaaa 

<210> 5 
15 <211> 2162 

<212> DNA 
<213> Homo Sapiens 

<400> 5 

20 cctggcccgg tcgcggtcgc ggctctttcc 
gggtcgtgca acgacgcagc tggacctggc 
ttccggcata aggtggattt tctgattgaa 
gtgctgcgaa tgtaccacca gaccatggac 
gtcatcaatg aacccagccg tctgcctctg 

25 aagcaccagg tggaatatga tcagctgacc 
cgtctggacc gtctgcaccc cgaaggcctc 
ggctgtgggc tcttcatctc ccacctcatc 
caggtagggg acgagatcgt ccggatcaat 
gaggtcatca acctcattcg aaccaagaaa 

30 ctgatccccg tgaaaagctc tcctgatgag 
gtgtcggaat ctgggggcgt gcgaggcagc 
gagaagaagg tcttcatcag cctggtaggc 
ggccccatcc agaagcctgg catctttatc 
gaggtgggat tggagatagg ggaccagatt 

35 ctggatcaca aggaggctgt aaatgtgctg 
gtagctgcag ctggccggga gctgttcatg 
cagcgtgagc tgcagcggca ggagcttctc 
aagatcctcc aggagcagca ggagatggag 
gcagcagagg aaaatgagag ataccggaag 

40 aagtttaaga agcaatggga agaagactgg 
accatcactg ctgaggtaca cccagtaccc 
tgaggaaggc tttgacccct actctatgtt 
ccggctccta cgcatcaaga aggagggatc 

191937.1 



ggggacgaga 


tcatggc^at 


caacggcaag 


XbzU 


gacgcxgccc 


ugcagaaggc 


ccggaaucag 


XboU 


gucugccccc 


c aaaggagx. a 


ugacgaugag 




dciC C ddd L. t- C 


dcycyuudyg 


dddc dg c y dy 


lOUU 


y y y 


-I— f~ rrci rra rra <TrT 

L- t_y dy dy dy y 


L-t—dL-cl L.y dLd 


lODU 


ctatcaccca 


ggaatctcaa 


actccctttg 


1920 


tcgggccact 


tttttgaagg 


ccaatgtgga 


1980 


gatctcaagg 


atccagactc 


tcattccttt 


2040 


gctttggggg 


actccttcct 


tgaaccctaa 


2100 


tccctctcct 


ctgccctctg 


ctctaattgc 


2160 


gagctcatta 


ataaaataaa 


cagatttatt 


2220 
2236 



agctcctggc 


agccgggcac 


ccgaaggaac 


60 


ccagccatgg 


accgaaaagt 


ggcccgagaa 


120 


aatgatgcag 


agaaggacta 


tctctatgat 


180 


gtggccgtgc 


tcgtgggaga 


cctgaagctg 


240 


tttgatgcca 


ttcggccgct 


gatcccactg 


300 


ccccggcgct 


ccaggaagct 


gaaggaggtg 


360 


ggcctgagtg 


tgcgtggtgg 


cctggagttt 


420 


aaaggcggtc 


aggcagacag 


cgtcgggctc 


480 


ggatattcca 


tctcctcctg 


tacccatgag 


540 


actgtgtcca 


tcaaagtgag 


acacatcggc 


600 


cccctcactt 


ggcagtatgt 


ggatcagttt 


660 


ctgggctccc 


ctggaaatcg 


ggaaaacaag 


720 


tcccgaggcc 


ttggctgcag 


catttccagc 


780 


agccatgtga 


aacctggctc 


cctgtctgct 


840 


gtcgaagtca 


atggcgtcga 


cttctctaac 


900 


aaaaatagcc 


gcagcctgac 


catctccatt 


960 


acagaccggg 


agcggctggc 


agaggcgcgg 


1020 


atgcagaagc 


ggctggcgat 


ggagtccaac 


1080 


cggcaaagga 


gaaaagaaat 


tgcccagaag 


1140 


gagatggaac 


agattgtaga 


ggaggaagag 


1200 


ggctcaaagg 


aacagctact 


cttgcctaaa 


1260 


cttcgcaagc 


caaagtgatt 


tccggaaata 


1320 


caccccagag 


cagatcatgg 


ggaaggatgt 


1380 


cttagacctg 


gccctggaag 


gcggtgtgga 


1440 
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ctcccccatt gggaaggtgg tcgtttctgc tgtgtatgag cggggaggtg ctgagcggca 1500 

tggtggcatt gtgaaagggg acgagatcat ggcaatcaac ggcaagattg tgacagacta 1560 

caccctggct gaggctgacg ctgccctgca gaaggcctgg aatcagggcg gggactggat 1620 

cgaccttgtg gttgccgtct gccccccaaa ggagtatgac gatgagctga ccttcttgct 1680 

5 gaagtccaaa aggggaaacc aaattcacgc gttaggaaac agtgagctcc ggccccacct 1740 

cgtgaacaca aagcctcgga ccagccttga gagaggccac atgacacaca ccagatggca 1800 

tccttgggac ctgaatctat cacccaggaa tctcaaactc cctttggccc tgaaccaggg 1860 

ccagataagg aacagctcgg gccacttttt tgaaggccaa tgtggaggaa agggagcagc 1920 

cagccgtttg ggagaagatc tcaaggatcc agactctcat tcctttcctc tggcccagtg 1980 

10 aatttggtct ctcccagctt tgggggactc cttccttgaa ccctaataag accccactgg 2040 

agtctctctc tctccatccc tctcctctgc cctctgctct aattgctgcc aggattgtca 2100 

ctccaaacct tactctgagc tcattaataa aataaacaga tttattttcc agcttaaaaa 2160 

aa 2162 



15 <210> 6 

<211> 1789 
r <212> DN& 

<213> Homo Sapiens 

C-20 <400> 6 

cttctggatg catccgagaa gctaaaactt acttatgagg aaaagtgtga aattgaggaa 60 

r=" . tcccaattga agtttttgag gaacgactta gctgaatatc agagaacttg tgaagatctt 120 

Z aaagagcaac taaagcataa agaatttctt ctggctgcta atacttgtaa ccgtgttggt 180 

ggtctttgtt tgaaatgtgc tcagcatgaa gctgttcttt cccaaaccca tactaatgtt 240 

E -=25 catatgcaga ccatcgaaag actggttaaa gaaagagatg acttgatgtc tgcactagtt 300 

p ;; ; tccgtaagga gcagcttggc agatacgcag caaagagaag caagtgctta tgaacaggtg 360 

H ; aaacaagttt tgcaaatatc tgaggaagcc aattttgaaa aaaccaaggc tttaatccag 420 

k tgtgaccagt tgaggaagga gctggagagg caggcggagc gacttgaaaa agaacttgca 480 

tctcagcaag agaaaagggc cattgagaaa gacatgatga aaaaggaaat aacgaaagaa 540 

^30 agggagtaca tgggatcaaa gatgttgatc ttgtctcaga atattgccca actggaggcc 600 

caggtggaaa aggttacaaa ggaaaagatt tcagctatta atcaactgga ggaaattcaa 660 

agccagctgg cttctcggga aatggatgtc acaaaggtgt gtggagaaat gcgctatcag 720 

ctgaataaaa ccaacatgga gaaggatgag gcagaaaagg agcacagaga gttcagagca 780 

aaaactaaca gggatcttga aattaaagat caggaaatag agaaattgag aatagaactg 840 

35 gatgaaagca aacaacactt ggaacaggag cagcagaagg cagccctggc cagagaggag 900 

tgcctgagac taacagaact gctgggcgaa tctgagcacc aactgcacct caccagacag 960 

gaaaaagata gcattcagca gagctttagc aaggaagcaa aggcccaagc ccttcaggcc 1020 

cagcaaagag agcaggagct gacacagaag atacagcaaa tggaagccca gcatgacaaa 1080 

actgaaaatg aacagtattt gttgctgacc tcccagaata catttttgac aaagttaaag 1140 

40 gaagaatgct gtacattagc caagaaactg gaacaaatct ctcaaaaaac cagatctgaa 1200 

atagctcaac tcagtcaaga aaaaaggtat acatatgata aattgggaaa gttacagaga 1260 

agaaatgaag aattggagga acagtgtgtc cagcatggga gagtacatga gacgatgaag 1320 

caaaggctaa ggcagctgga taagcacagc caggccacag cccagcagct ggtgcagctc 1380 
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ctcagcaagc agaaccagct tctcctggag aggcagagcc tgtcgga^Lga ggtggaccgg 1440 

ctgcggaccc agttacccag catgccacaa tctgattgct gacctggatg gaacagagtg 1500 

aaataaatga attacaaaga gatatttaca ttcatctggt ttagacttaa tatgccacaa 1560 

cgcaccacga ccttcccagg gtgacaccgc ctcagcctgc agtggggctg gtcctcatca 1620 

5 acgcgggcgc tgtccccgca cgcagtcggg ctggagctgg agtctgactc tagctgagca 1680 

gactcctggt gtatgttttc agaaatggct tgaagttatg tgtttaaatc tgctcattcg 1740 

tatgctaggt tatacatatg attttcaata aatgaacttt ttaaagaaa 1789 

<210> 7 
10 <211> 1306 

<212> DNA 
<213> Homo Sapiens 



<400> 7 

15 aaaaatagcc gcagcctgac catctccatt 
acagaccggg agcggctggc agaggcgcgg 
atgcagaagc ggctggcgat ggagtccaac 
cggcaaagga gaaaagaaat tgcccagaag 
gagatggaac agattgtaga ggaggaagag 

20 ggctcaaagg aacagctact cttgcctaaa 
cttcgcaagc caaagtatga tcagggagtg 
gatggaggca cggaggagca gggagagcag 
ccctactcta tgttcacccc agagcagatc 
aagaaggagg gatccttaga cctggccctg 

25 gtggtcgttt ctgctgtgta tgagcgggga 
ggggacgaga tcatggcaat caacggcaag 
gacgctgccc tgcagaaggc ctggaatcag 
gtctgccccc caaaggagta tgacgatgag 
aaccaaattc acgcgttagg aaacagtgag 

30 cggaccagcc ttgagagagg ccacatgaca 
ctatcaccca ggaatctcaa actccctttg 
tcgggccact tttttgaagg ccaatgtgga 
gatctcaagg atccagactc tcattccttt 
gctttggggg actccttcct tgaaccctaa 

35 tccctctcct ctgccctctg ctctaattgc 
gagctcatta ataaaataaa cagatttatt 



gtagctgcag ctggccggga gctgttcatg 60 

cagcgtgagc tgcagcggca ggagcttctc 120 

aagatcctcc aggagcagca ggagatggag 180 

gcagcagagg aaaatgagag ataccggaag 240 

aagtttaaga agcaatggga agaagactgg 300 

accatcactg ctgaggtaca cccagtaccc 360 

gaacctgagc tcgagcccgc agatgacctg 420 

gatttccgga aatatgagga aggctttgac 480 

atggggaagg atgtccggct cctacgcatc 540 

gaaggcggtg tggactcccc cattgggaag 600 

gctgctgagc ggcatggtgg cattgtgaaa 660 

attgtgacag actacaccct ggctgaggct 720 

ggcggggact ggatcgacct tgtggttgcc 780 

ctgaccttct tgctgaagtc caaaagggga 840 

ctccggcccc acctcgtgaa cacaaagcct 900 

cacaccagat ggcatccttg ggacctgaat 960 

gccctgaacc agggccagat aaggaacagc 1020 

ggaaagggag cagccagccg tttgggagaa 1080 

cctctggccc agtgaatttg gtctctccca 1140 

taagacccca ctggagtctc tctctctcca 1200 

tgccaggatt gtcactccaa accttactct 1260 

ttccagctta aaaaaa 1306 



<210> 8 

<211> 2289 

40 <212> DNA. 

<213> Homo Sapiens 



<400> 8 
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cctggcccgg tcgcggtcgc ggctctttcc 
gggtcgtgca acgacgcagc tcjgacctggc 
ttccggcata aggtggattt tctgattgaa 
gtgctgcgaa tgtaccacca gaccatggac 

5 gtcatcaatg aacccagccg tctgcctctg 
aagcaccagg tggaatatga tcagctgacc 
cgtctggacc gtctgcaccc cgaaggcctc 
ggctgtgggc tcttcatctc ccacctcatc 
caggtagggg acgagatcgt ccggatcaat 

10 gaggtcatca acctcattcg aaccaagaaa 
ctgatccccg tgaaaagctc tcctgatgag 
gtgtcggaat ctgggggcgt gcgaggcagc 
gagaagaagg tcttcatcag cctggtaggc 
ggccccatcc agaagcctgg catctttatc 

15 gaggtgggat tggagatagg ggaccagatt 
ctggatcaca aggaggctgt aaatgtgctg 
gtagctgcag ctggccggga gctgttcatg 
cagcgtgagc tgcagcggca ggagcttctc 
aagatcctcc aggagcagca ggagatggag 

20 gcagcagagg aaaatgagag ataccggaag 
aagtttaaga agcaatggga agaagactgg 
accatcactg ctgaggtaca cccagtaccc 
gaacctgagc tcgagcccgc agatgacctg 
ccacaggaga tgttgaagag gatggtggtt 

25 ggaaatatga ggaaggcttt gacccctact 
aggatgtccg gctcctacgc atcaagaagg 
gtgtggactc ccccattggg aaggtggtcg 
agcggcatgg tggcattgtg aaaggggacg 
cagactacac cctggctgag gctgacgctg 

30 actggatcga ccttgtggtt gccgtctgcc 
tcttgctgaa gtccaaaagg ggaaaccaaa 
cccacctcgt gaacacaaag cctcggacca 
gatggcatcc ttgggacctg aatctatcac 
accagggcca gataaggaac agctcgggcc 

35 gagcagccag ccgtttggga gaagatctca 
cccagtgaat ttggtctctc ccagctttgg 
ccactggagt ctctctctct ccatccctct 
attgtcactc caaaccttac tctgagctca 
ttaaaaaaa 

40 



agctcctggc agccgggcac ccgaaggaac 60 

ccagccatgg accgaaaagt ggcccgagaa 120 

aatgatgcag agaaggacta tctctatgat 180 

gtggccgtgc tcgtgggaga cctgaagctg 240 

tttgstgcca ttcggccgct gatcccactg 300 

ccccggcgct ccaggaagct gaaggaggtg 360 

ggcctgagtg tgcgtggtgg cctggagttt 420 

aaaggcggtc aggcagacag cgtcgggctc 480 

ggatattcca tctcctcctg tacccatgag 540 

actgtgtcca tcaaagtgag acacatcggc 600 

cccctcactt ggcagtatgt ggatcagttt 660 

ctgggctccc ctggaaatcg ggaaaacaag 720 

tcccgaggcc ttggctgcag catttccagc 780 

agccatgtga aacctggctc cctgtctgct 840 

gtcgaagtca atggcgtcga cttctctaac 900 

aaaaatagcc gcagcctgac catctccatt 960 

acagaccggg agcggctggc agaggcgcgg 1020 

atgcagaagc ggctggcgat ggagtccaac 1080 

cggcaaagga gaaaagaaat tgcccagaag 1140 

gagatggaac agattgtaga ggaggaagag 1200 

ggctcaaagg aacagctact cttgcctaaa 1260 

cttcgcaagc caaagtatga tcagggagtg 1320 

gatggaggca cggaggagca gggagagcag 1380 

tatcaagaca gcattcaaga caagatttcc 1440 

ctatgttcac cccagagcag atcatgggga 1500 

agggatcctt agacctggcc ctggaaggcg 1560 

tttctgctgt gtatgagcgg ggagctgctg 1620 

agatcatggc aatcaacggc aagattgtga 1680 

ccctgcagaa ggcctggaat cagggcgggg 1740 

ccccaaagga gtatgacgat gagctgacct 1800 

ttcacgcgtt aggaaacagt gagctccggc 1860 

gccttgagag aggccacatg acacacacca 1920 

ccaggaatct caaactccct ttggccctga 1980 

acttttttga aggccaatgt ggaggaaagg 2040 

aggatccaga ctctcattcc tttcctctgg 2100 

gggactcctt ccttgaaccc taataagacc 2160 

cctctgccct ctgctctaat tgctgccagg 2220 

ttaataaaat aaacagattt attttccagc 2280 

2289 
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